A Binaural Auditory Model for Missing Data Recognition of Speech in Noise
نویسندگان
چکیده
We describe a binaural auditory model for speech recognition, which is robust in the presence of reverberation and spatially separated noise intrusions. The principle underlying the model is to identify time-frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by applying a simple model of reflection masking. Reliable timefrequency regions are passed to a missing data speech recognizer. We show, firstly, that the auditory model improves recognition performance in various reverberation conditions when no noise intrusion is present. Secondly, we demonstrate that the model improves performance when the speech signal is contaminated by noise, both for an anechoic environment and in the presence of simulated room reverberation.
منابع مشابه
مدل میکروسکوپی دوگوشی مبتنی بر فیلتر بانک مدولاسیون برای پیش گویی قابلیت فهم گفتار در افراد دارای شنوایی عادی
In this study, a binaural microscopic model for the prediction of speech intelligibility based on the modulation filter bank is introduced. So far, the spectral criteria such as the STI and SII or other analytical methods have been used in the binaural models to determine the binaural intelligibility. In the proposed model, unlike all models of binaural intelligibility prediction, an automatic ...
متن کاملA binaural processor for missing data speech recognition in the presence of noise and small-room reverberation
In this study we describe a binaural auditory model for recognition of speech in the presence of spatially separated noise intrusions, under small-room reverberation conditions. The principle underlying the model is to identify time–frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by gr...
متن کاملA Binaural Model for Missing Data Speech Recognition in Noisy and Reverberant Conditions
We describe a binaural auditory model for speech recognition, which is robust in the presence of reverberation and spatially separated noise intrusions. The principle underlying the model is to identify time-frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by applying a simple model of ...
متن کاملA computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise
This study describes a model of binaural speech recognition that is tested against psychoacoustic findings on binaural speech intelligibility in noise. It consists of models of the auditory periphery, binaural pathway and recognition of speech from glimpses based on the missing data approach, which allows the speech reception threshold (SRT) of the model and listeners to be compared. The binaur...
متن کاملIdentifying the human-machine differences in complex binaural scenes: what can be learned from our auditory system
Previous comparisons of human speech recognition (HSR) and automatic speech recognition (ASR) focused on monaural signals in additive noise, and showed that HSR is far more robust against intrinsic and extrinsic sources of variation than conventional ASR. The aim of this study is to analyze the man-machine gap (and its causes) in more complex acoustic scenarios, particularly in scenes with two ...
متن کامل